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Background noise in wind tunnel environments poses a challenge to acoustic measurements due to 
possible low or negative Signal to Noise Ratios (SNRs) present in the testing environment. This paper 
overviews the application of time domain Adaptive Noise Cancellation (ANC) to microphone array signals 
with an intended application of background noise reduction in wind tunnels. An experiment was conducted to 
simulate background noise from a wind tunnel circuit measured by an out-of-flow microphone array in the 
tunnel test section. A reference microphone was used to acquire a background noise signal which interfered 
with the desired primary noise source signal at the array. The technique’s efficacy was investigated using 
frequency spectra from the array microphones, array beamforming of the point source region, and 
subsequent deconvolution using the Deconvolution Approach for the Mapping of Acoustic Sources (DAMAS) 
algorithm. Comparisons were made with the conventional techniques for improving SNR of spectral and 
Cross-Spectral Matrix subtraction. The method was seen to recover the primary signal level in SNRs as low 
as -29 dB and outperform the conventional methods. A second processing approach using the center array 
microphone as the noise reference was investigated for more general applicability of the ANC technique. It 
outperformed the conventional methods at the -29 dB SNR but yielded less accurate results when coherence 
over the array dropped. This approach could possibly improve conventional testing methodology but must be 
investigated further under more realistic testing conditions. 
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Nomenclature 

Finite Impulse Response filter coefficients at step n 
steering vector for array to focus location 
component of e for microphone m 
frequency 

frequency bandwidth resolution of spectra 
cross-spectrum between p m and p m • 
matrix (CSM) of cross-spectral elements G mm ’ 
counting number of CSM averages 
microphone identity number in array 
same as m , but independently varied 
total number of array microphones 
background noise present in primary ANC input 
background noise present in reference ANC input 
total number of FIR filter weights 
pressure time records from microphone m 
Fourier Transform of p m 

distance r m for m being the center array microphone 

coordinate distance to m 

signal from primary sound source 
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output power response of the array at focus location 

data block length 

complex transpose (superscript) 

data window weighting constant 

reference input at step n 

FIR filter output at step n 

ANC output at step n 

coherence 

ANC error at step n 

LMS algorithm step size 

time delay 

propagation time from grid point to microphone m 


I. Introduction 

N ORMALLY designed wind tunnels are not optimal for acoustics testing because they are designed with only 
aerodynamic considerations in mind. The acoustic measurements of components placed for testing in the 
tunnels are often masked by background noise generated by the wind tunnel flow drive or the flow itself, resulting in 
low or negative SNRs. This background noise limits the accurate measurement, identification, and quantification of 
noise sources under study. Thus, reduction of background noise to improve the SNR in testing environments is of 
interest. 

In wind tunnels, techniques have been investigated that remove noise contamination from source noise 
measurements [1]. One technique used the assumption that if the background noise at two different transducers was 
uncorrelated yet statistically equal, the frequency spectrum due to only the model source could be recovered from 
the difference between the two time-series. Improvements were made on the technique over the next two decades [2- 
5]. In 1989, a technique for capturing model noise from turbulent boundary layer wall-pressure fluctuations was 
investigated [6]. The authors postulated that acoustic disturbances propagated down the wind tunnel acting as a 
waveguide. Turbulent pressure fluctuations due to local velocity disturbances were found to be uncorrelated to the 
acoustic disturbances at any streamwise location. Two microphones at the same streamwise location yet separated 
spanwise were used. One signal was delayed by a time constant x then subtracted from the other signal to form a 
new time-series. The true spectrum due to the model source was obtained from this time-series at frequencies 1/x 
and higher harmonics. It was concluded that some of the model source signal was cancelled along with the 
contaminating noise if the microphone spacing was insufficient. This approach was extended beyond the 
aforementioned frequency-domain studies to time-domain noise cancellation in 1996 [7]. The authors used an 
optimizing Finite Impulse Response (FIR) filter to estimate the correlated noise between a primary and reference 
signal then subtracted it from the reference signal. The turbulent pressure components in each signal were taken as 
uncorrelated as long as the transducers were separated by a sufficient distance [6]. Therefore, the correlated parts 
belonged to the common noise between the signals. This was then subtracted from the reference signal leaving just 
the source noise time-series with magnitude and phase intact. At low frequencies part of the source signal was 
cancelled along with the background noise if insufficient transducer spacing was used. It was shown that this 
undesired source signal cancellation [7] was an order of magnitude smaller than that removed using the 
aforementioned methods [1-6]. 

Early efforts in the study of time-domain Adaptive Noise Cancellation (ANC) were made in the 1960s [8, 9] 
and the technique was formalized in 1975 [10]. The method used a minimum of two channels: a reference channel 
intended to receive only undesired background noise and a primary channel containing the signal plus background 
noise. The reference time-series is fed into a filter which is adapted to produce a best estimate of the noise present in 
the primary time-series and then subtracted from the primary signal leaving a “cleaned” result. Since its publication 
the technique has been widely employed primarily for applications of noise reduction for improved speech 
recognition. Note that Adaptive Noise Cancellation is an electronic, in-wire signal cancellation technique, as 
opposed to Active Noise Cancellation which involves actual pressure fluctuation cancellation. 

An early, simplified experiment employing ANC reported at least 20 dB noise cancellation with minimum 
distortion [1 1]. Background noise in a fighter jet cockpit presented an important application of the technique that led 
to the identification of the major challenges for successful implementation. A reference microphone attached to the 
outside of the helmet was used to capture the ambient noise present in order to cancel it from the pilot’s microphone. 
An ideal simulation [12] of the cockpit environment reported an 11 dB SNR improvement. An important 
consideration discussed was acquisition of the desired signal by the reference channel, which would result in 
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cancellation of the desired signal at the output. This was circumvented by adapting the filter used for cancellation 
only during periods of background noise. A more realistic evaluation [13] of the cockpit environment was performed 
by simulating a diffuse background noise field and it was concluded that good cancellation was not possible at 
frequencies above 2 kHz. The study identified coherence between the reference and primary channels as the most 
important factor in successful cancellation. Increasing the distance between the transducers was cited as the cause 
for low coherence above 2 kHz. This was an effect of a diffuse sound field [14]. 

Experimental implementation of ANC appears to have conflicting requirements: good coherence is necessary 
for noise cancellation, necessitating the reference transducer be located either close to the noise source(s) (if 
localized) or close to the primary transducer(s) (if a diffuse noise field present). And if the noise source is not 
localizable, locating the reference transducer close to the primary may result in cancellation of the desired signal. 

In microphone array applications, background noise has been removed using a spectral subtraction technique 
since 1974 [15]. The method relies on a separate acquisition of the background noise in the testing environment 
from that when the noise source under study is present. Upon converting signals to the frequency domain, this 
“background only” acquisition is subtracted (on a pressure-squared basis) from that of the primary acquisition 
(signal plus noise). With the introduction of beamforming using microphone arrays, this same principle has been 
applied to Cross-Spectral Matrices (CSMs) since the late 1990s [16-20]. Both techniques rely on the cancellation of 
the noise in its statistical sense, since the background noise present will not be identical during two acquisitions 
separated in time. The CSM subtraction technique allows further noise reduction beyond what spectral subtraction 
can achieve on a per channel basis. This is because uncorrelated noise at distinct array microphones is already 
reduced by the cross-correlation performed when forming the CSM [21]. These techniques are compared herein to 
ANC in an experiment intended to provide simulated wind tunnel background noise being measured by an out-of- 
flow array in a wind tunnel test section. 


Primary 



II. Theory 

Adaptive Noise Cancellation. Noise cancellation in the time domain utilizes a reference input, ideally 
containing just noise, which is passed through an adaptive filter and later subtracted from a primary input containing 
both the desired signal and correlated 
components of the noise present in the 
reference input. The minimized output 
becomes the primary signal with the noise 
attenuated or cancelled altogether. The 
correlation between the noise present in the 
reference channel and the primary input is 
important: the higher it is, the better the 
cancellation [13]. A diagram of the concept is 
pictured in Fig. 1. 

In Fig. 1, a primary sound source (s) that 
contains uncorrelated noise (n 0 ) is transmitted 
to the upper left channel. The bottom left 
channel receives noise (nj) correlated with n 0 
in some unknown manner. It is fed into the 
adaptive filter with the goal of replicating n 0 . 

The output of the adaptive filter is subtracted 
from the primary input (s + n 0 ) to produce z, 
which also serves as the error signal to the 
filter. 

The interior of the dashed box of Fig. 1 is 
given in Fig. 2. Success of the adaptation is 
dependent on the causality of the system, i.e. 
the noise reference must be present in the 
system before the input. From Fig. 2, the 
reference signal (. x n ) is fed to the FIR filter at 
the start of the processing. The initial filter 
coefficients (< c n ) and number of weights are 
provided by the user. The output of the filter 


Adaptive Noise Canceller 


Figure 1. Adaptive noise canceller concept [10]. 



Figure 2. Adaptive noise canceller concept with LMS 
algorithm and FIR filter block diagrams included. 
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(y„) is subtracted from the primary input signal (s + n 0 , an array microphone) which contains a correlated version of 
the noise present in the reference signal from an earlier time. The error (s n ) is then fed into the Least Mean Squares 
(LMS) algorithm [10] and multiplied with the reference input (x n ) and the step size (ju) to produce the next set of 
filter coefficients (c n+1 ). The order of the FIR filter is the number of weights or “taps” used. FIR filters are always 
stable and have linear phase response given that the coefficients are symmetrical. 

The difference equation defining the output of the filter is 

Vn ~ "f fiAn-l T •“ + C N X n _ N (1) 

where N is the number of filter weights specified. As the noise to be cancelled is present in the reference signal 
before it is in the primary signal, the system is causal and successful adaptation will be achieved. This is seen in Eq. 
(1). As long as the number of filter weights is sufficient to encompass the time delay between the reference and 
primary signal (in number of samples), the noise present in the primary signal will be related to that seen in an 
earlier sample still present in y n . For proof of the concept see [10]. 

ANC will be implemented by removing the noise from each microphone array channel in the time domain. 
These “cleaned” signals will then be used in frequency domain processing to assess the effectiveness of the noise 
removal. 


Beamforming. Beamforming in the frequency domain utilizes phase shifts between the microphone array 
signals to “steer” the focus of the array and thus obtain more detailed sound pressure magnitude information at 
defined points in space. 

The first step in beamforming is to compute the Cross Spectral Matrix (CSM) for the data set to be analyzed 
[22]. The CSM is formed from the Fast Fourier Transforms (FFTs) of the data set. The FFTs of two microphones, m 
and m \ are denoted P m (f, T) and T) and are formed from the microphones’ time records p m (t) and p m ( t). 

The frequencies,/, at which the transforms are defined are determined by the bandwidth, A/= 1/T (Hz), where T is 
the data block length. A CSM element, as a function of frequency, is defined 


’(/) = ^£fc=l[ P mfc(/’ T )fVfc 


(/- T)] 


The total record length is T tot = TK. The summation is multiplied by two because it is a one-sided cross-spectrum. It 
is then divided by the data block length T to normalize the FFT output and the number of block averages K to get a 
mean value across the data blocks. The term is a weighting constant used when a weighted window is 
implemented. The star on the first transform denotes the complex conjugate. The CSM element is a complex 
spectrum. The full CSM for M array microphones is a Hermitian matrix and is given as 
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The beamforming uses the CSM to electronically “steer” to positions in space defined by the user. For the work 
done here, the array was one dimensional and the grid points were located along a line that bisected the sound 
source. In order to steer to grid points on the line, vectors must be calculated between each array microphone and the 
grid point being steered to. The steering vector for microphone m is 


e m = ( 7 1 ) e i2nfTm 


where r m is the straight line distance from microphone m to the grid point, r c denotes the straight line distance from 
the center microphone to the grid point, and z m is the propagation time for sound to travel between microphone m 

and the grid point. The term is included to normalize the distance dependent amplitude of the steering vector to 
that of the center array microphone. The steering vector matrix, size Mxl , is 


e = col [e ± e 2 
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The steering vectors, being complex, will phase shift the contributions from each microphone in the array to allow 
for constructive summing at the chosen grid point. 

Finally, the array’s response is given as an output power spectrum 


S(e ) = e -^ 

v y M 2 


( 6 ) 


The response has units of mean-pressure-squared vs. frequency bandwidth. Dividing by the total number of 
microphones squared normalizes the response to that of a single microphone level. The superscript T is taking the 
complex transpose of the steering vector matrix. 


DAMAS. The Deconvolution for the Mapping of Acoustic Sources [22] will be used to provide further detail of 
the sound source under study in the presence of noise. This technique uses the beamform response as a precursor 
and removes microphone array characteristics (point spread function) to provide an estimation of only the sound 
source strength that exists in the scanning region. The technique uses an iterative relaxation-type solver to provide 
non-ambiguous, enhanced resolution presentations of sound sources and is discussed in more detail in [22]. 


III. Experimental Setup 

A small anechoic chamber housed at NASA Langley Research Center (LaRC) was used for the test area. 
Approximate dimensions of the room are 97x77x81 inches (length/width/height), wedge tip-to-tip. 

Nine B&K quarter-inch 
free- field microphones were 
used to construct a linear 
1.16’’ 1.03’’ 1.19’" i.oo” 1.03’’ 0 .88 ” 1.06” 0 .94” array using metal support 

, rods with base stands, as 

5 9 9 9 9 9 9 9 9j shown in Fig. 3 a. The 

1 5 9 desired microphone spacing 

Figure 3. (a) Linear microphone array, and (b) Microphone spacing. was 1” between diaphragm 

centers; actual spacing 
differed slightly due to equipment 
limitations (see Fig. 3b for exact 
dimensions). 

Since the microphone array is 
linear and configured broadside 
towards the primary source, its 

sensing ability lies only along a line 
parallel with the array at the same 
vertical height. Thus, the array, 
source speaker, and background 
speaker were all placed in the same 
plane parallel to the floor, at a 
distance of 46.5” above the floor to 
ensure that the speaker was high 
enough above the wedges to avoid 
low frequency near-field reflections. 
Refer to Fig. 4 for a diagram of the 
setup. The source speaker was 
perpendicular to the array at a 
distance of 67”. The background 
speaker was at an angle of 59° (top 
view) to the array and a distance of 
43” from microphone 1. A reference 
microphone, labeled 10, was situated 
a distance of 5” away from the diaphragm center of the background speaker with its rear facing the source speaker to 
limit the source speaker’s influence on the microphone diaphragm. 



Figure 4. ANC test setup. 
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Figure 5. a) ANC setup picture with background speaker and reference microphone in the near-field and 
array in the far-field, and b) Point source speaker with copper extension tube installed. 

A Pioneer 5.25” Cup Midrange background speaker is pictured in the foreground of Fig. 5a. The reference 
microphone (10) is situated to its right and the microphone array is in the background. A custom source speaker with 
a Selenium Compression Driver was used as the point source (desired) and is pictured in Fig. 5b. The point source 
speaker was fitted with a copper extension tube to further simulate an ideal point source and a foam insert was 
placed in the end of the tube to reduce high frequency reflections. A pink noise generator was used to drive the point 
source speaker and a white noise generator was used to power the background speaker. 

IV. Results 

This section presents example test data that was processed utilizing the signal processing techniques discussed 
in section n. The frequency resolution for the data presentations is 20 Hz. The microphone signals were high-pass 
filtered at 400 Hz to avoid any low-frequency reflections present in the chamber. 

Table 1 gives parameters for the cases analyzed in this study in order to investigate the potential and limitations 
associated with ANC in microphone array applications. The first two use microphone 10 (see Fig. 4) as the reference 
channel to cancel noise at the array microphones. Microphone 10 gives the best estimate of the noise because of its 
proximity to the background speaker. The first case examines results at a low SNR. This results in high coherence 
between microphone 10 and the array microphones. The second case is for a higher SNR acquisition. The coherence 
between microphone 10 and the array microphones drops as a result of the increased SNR at the array. The third and 
fourth cases follow the same pattern. However, microphone 5 is used as the reference channel to cancel the noise on 
the array microphones. Microphone 5, being the center array microphone, gives an average of the noise sensed over 
the array. 


Table 1. Test Cases. 







Narrowband Coherence 

Case 

Acquisition 

Average SNR, 
0.5-9.5 kHz (dB) 

Narrowband 
F requency (kHz) 

Narrowband 
SNR (dB) 

Mic 10 - 
Array 

Mic 5 - 
Array 

1 

a 

-19.9 

6.3 

-29.2 

0.95 

- 

2 

b 

-6.5 

8.4 

-3.6 

0.43 

-- 

3 

a 

-19.9 

6.3 

-29.2 

- 

0.99 

4 

b 

-6.5 

8.4 

-3.6 

-- 

0.43 


Note that while the low SNR cases pose the largest signal recovery in decibels (dB), the high coherence 
between the reference and array microphones will be beneficial to ANC recovery. This is because attenuation as a 
function of coherence (y 2 ) [23, 24] is calculated as 

attenuation ( dB ) = 10 x log 10 (l - y 2 ) (7) 

For each case, the noise attenuation results using ANC will be compared to spectral subtraction using 
autospectra. Then, array processing techniques (beamforming and DAMAS, see section II) will be used to compare 
the noise attenuation results from ANC with CSM subtraction. 
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Case 1. Figure 6 displays autospectra from the center array microphone. For all cases, the sampling frequency 
used was 20 kHz. The digital antialiasing filter on the data acquisition system was set to 9.7 kHz; the low-pass filter 
rolls off sharply: by the Nyquist frequency (10 kHz) the reduction is -10 dB, and by 10.5 kHz the reduction falls to - 
50 dB with zero phase error. The drop-off due to the low-pass filter will be seen in the subsequent autospectra. Note 
that the data are plotted as power spectra and thus specific to the acquisition parameters. The blue line corresponds 
to the acquisition when both the point source and background speakers were operated simultaneously. The average 
SNR between the two over the frequency range 0.5-9. 5 kHz is -19.9 dB. The noise floor (all equipment off) of the 
room is plotted in light blue for reference. The desired outcome of the noise attenuation processing is to recover the 
point source spectrum (red) from the combined spectrum (source plus noise, blue). 

The noise attenuation results from spectral subtraction (see section I) are shown in black. The choppiness of the 
signal is due to the 20 Hz bandwidth. Large dips in the signal correspond to frequencies at which the value of the 
acquisition of just the background noise speaker operating (not shown) approaches that of the combined acquisition 
(blue). It is seen that that an average attenuation from the combined spectrum of 1 1 dB is achieved (0.5-9. 5 kHz). 
Again, the point source spectrum (red) is the signal to recover. Even with the achieved attenuation, the spectral 
subtraction data are still 6 dB or more above the point source spectrum. 

The noise attenuation data using ANC with microphone 10 as the background noise reference are plotted in 
bright green. The results give almost perfect attenuation across the 0.5-9. 5 kHz frequency band, providing an 
excellent recovery of the point source signal. This is due to two factors: i) high coherence present between the 
reference and array microphones, and ii) the reference microphone being dominated by the background noise due to 
its proximity to the background speaker. The second factor allows the reference microphone to obtain a very good 
estimate of the background noise signal free of contamination from the point source signal. This is important 
because microphone 10 is not shielded from point source speaker radiation. It experiences similar levels to the array 
microphones when just the point source is operated. When both the point source and background speakers are 
operated, microphone 10’s level is an average of 20 dB greater than the array microphones, which yields almost 40 
dB greater (0.5-9. 5 kHz) than the point source alone for this case. This difference in level effectively “shields” 
microphone 10 from point source influence. 



Figure 6. Center array microphone autospectra for Case 1. 


Referencing section II, beamforming is an array technique which calculates sound pressure magnitude at 
distinct points in space. The points in space are user defined with a grid. The grid location is chosen to provide 
details of the sound source under study, in this case the point source speaker. Thus, a 4 ft horizontal line at the height 
of the point source speaker with its center at the speaker location was chosen as the one dimensional scanning grid 
over which to perform the beamforming. Note that beamforming as a technique already attenuates noise as 
uncorrelated terms in the CSM tend to zero in the statistical limit [21]. Therefore, beamforming offers SNR 
improvements over single microphone spectra, and thus CSM subtraction an improvement over spectral subtraction. 
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Figure 7 gives the beamform responses at 6.3 kHz. Criteria from [22] for a reasonable resolution range were 
used. The color conventions were kept the same as Fig. 6 for continuity and are used throughout. The desired 
outcome of the noise attenuation processing is to recover the point source beamform (red) from the point source plus 
background noise beamform (blue). The combined point source and background speaker acquisition beamform is 
clearly distorted. At almost 30 dB down (Fig. 6), the small (9 microphone) array has difficulty recovering the point 
source strength (x = 0 ft) or lobe structure from the combined acquisition. Note that a larger array would have more 
accurate desired signal recovery under the same conditions due to the array SNR gain [21] 

array gain ( dB ) = 10 x log 10 (# of array micr op hones) (8) 

The black line corresponds to the noise attenuated beamform obtained through CSM subtraction (see sections I 
and II). While it matches the point source level (slightly to the right of x = 0 ft), its distorted sidelobe structure is 
similar to that of the combined acquisition. The green line gives the noise attenuated beamform using ANC with 
microphone 10 as the reference signal. It matches the point source beamform well at the source location (x = 0 ft) 
and has similar sidelobe structure, indicating that the point source magnitude is recovered and phase preserved 
correctly during the ANC process. 



Deconvolution using DAMAS [22] removes microphone array characteristics from the beamform response to 
provide an estimation of only the sound source strength that exists in the scanning grid. For instance, if the point 
source speaker were a dot in space at x = 0 ft operating at 79 dB, an ideal DAMAS deconvolution would converge 
through an iterative process on one point at x = 0 ft with a level of 79 dB. All the rest of the grid point levels would 
tend towards negative infinity, because the point source speaker is the only sound source located in the scanning 
region. This deconvolved beamform would represent an enhanced resolution sound map indicating the exact 
location in the scanning grid and sound pressure level (SPL) of the point source speaker. 

Referring to the beamform responses given in Fig. 7, a resolution range for the DAMAS results was chosen to 
be 65-89 dB. Note that experimental results will not fall to 0 dB where no sound sources exist in the beamform grid. 
This lower SPL bound will illustrate that the deconvolution eliminates the sidelobe structure present in the 
beamform responses. This presentation choice will be used throughout. 

Figure 8 displays the DAMAS results processed from the beamform responses of Fig. 7. Convergence was 
reached after 45 iterations. The goal of the noise attenuation processing is to recover the point source deconvolution 
map from the combined point source and background data. The combined map displays four “source” peaks over the 
grid. All are erroneous in location and level. This can be stated because the point source map identifies the correct 
location and level of the point source speaker, the only sound source located in the scanning grid. The deconvolved 
map from the beamform using CSM subtraction also fails to locate the point source speaker in location or level. The 
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DAMAS results from the ANC beamform response converge on a deconvolved map that correctly locates the point 
source speaker location and estimates it strength to within 0.3 dB. 



Grid Line, ft 

Figure 8. DAMAS results for Case 1 (6.3 kHz, SNR = -29.2 dB). 

Case 2. Figure 9 presents the autospectra from the center array microphone. The average SNR between 
combined and point source spectrum over the frequency range 0.5-9. 5 kHz is -6.5 dB. The spectral subtraction 
results are seen to recover the point source spectrum more accurately due to the higher SNR (with respect to Case 1). 
Loss of recovery accuracy occurs at frequencies where the SNR decreases. The ANC results using microphone 10 
again give almost perfect attenuation across the 0.5-9. 5 kHz frequency band. Note that at the frequency chosen for 
narrowband analysis (8.4 kHz), the average coherence between microphone 10 and the array microphones is 0.43. 
Even with this low coherence the ANC recovered spectrum is within 0.1 dB of the point source level. This is 
attributed to the relatively high SNR and again because microphone 10’s signal is dominated by the background 
noise, due to its proximity to the background speaker, which provides an accurate estimate of the noise to be 
cancelled from the array signals. 



Figure 9. Center array microphone autospectra for Case 2. 
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Figure 10 gives the beamforming results at 8.4 kHz for Case 2. Due to the increased SNR (with respect to Case 
1), the beamform of the combined acquisition matches the point source only results at the source position (x = 0 ft) 
and has similar sidelobe structure. This is achieved without any noise attenuation processing and provides evidence 
that beamforming as a technique attenuates noise due to uncorrelated noise being cancelled upon forming the CSM. 
Both the processed, noise attenuated beamform responses, CSM subtraction and ANC using microphone 10, match 
the point source level at the speaker location and have very similar sidelobe structures. At this SNR, both methods 
are effective at recovering the point source beamform response. The low coherence (0.43) present at this frequency 
did not hinder the ANC recovery. 



Figure 10. Beamform results for Case 2 (8.4 kHz, SNR = -3.6 dB). 


Figure 11 displays the DAM AS results processed from the beamform responses of Fig. 10. Convergence was 
reached after 58 iterations. Although the combined map displays various low-level “sources” they are 14 dB or more 
below the point source level (x = 0 ft). Its largest peak is located accurately at the point source speaker location and 
it estimates the source strength to within 0.7 dB. The DAM AS results from the CSM subtraction and ANC 
beamform responses converge at the correct point source speaker location and estimate its level to within 0.1 dB. 
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Figure 11. DAMAS results for Case 2 (8.4 kHz, SNR = -3.6 dB). 
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Cases 3 and 4 take a difference approach to the ANC processing due to the fact that array microphone 5 is used 
as the background noise reference signal. As microphone 5 is directly exposed to the point source speaker radiation 
and records levels on the order of the other array mics, its signal should not be adapted during the noise removal 
filtering. This is due to the concern that the desired signal will be cancelled along with the noise from the channels 
being processed with microphone 5 as the noise reference [12], because they are both exposed to point source 
radiation at similar magnitudes (contrast this with microphone 10). Thus, the ANC filter weights are converged 
using the acquisition of only the background noise speaker operating, then saved to process (with zero adaptation, p 
= 0) the signals obtained during the combined (point source plus background noise) acquisition. If the background 
noise were exactly repeatable, using the saved weights from background noise acquisition processing and zero 
adaptation would achieve exact cancellation of the noise. This is not the case as random white noise was used as the 
background noise input. The ANC filter adapts its weights based on time-domain transducer signals. While the 
background noise from one acquisition to the next is spectrally similar, no two randomly generated time signals are 
the same. Thus, the weights used from the background only acquisition will be good estimates of the weights 
required for background noise cancellation in the combined case, but not exact. As adaptation is frozen (p = 0, 
weights are not updated), error is introduced in recovery. It is noted that time-domain cancellation, in the limit, will 
always perform better than spectral cancellation because it is exactly cancelling the noise at each time step in the 
processing. Spectral cancellation relies on statistical approximation and therefore slight variations in the noise are 
not accounted for and must be averaged to produce an estimate of the noise. 

Case 3. Figure 12 presents the array averaged autospectra for this case. The data are from the same acquisition 
as Case 1. For Cases 3 and 4 the autospectra are array averaged. The same spectral subtraction performance from 
Case 1 is seen here: an average attenuation of 1 1 dB is achieved but the data are still 6 dB or above the point source 
spectrum (0.5-9. 5 kHz). 

The noise attenuation data using ANC with microphone 5 as the background noise reference are plotted in 
bright green. The results give significant attenuation across the 0.5-9. 5 kHz frequency band, but are an average 3 dB 
above the point source spectrum (the desired signal to recover). Although the point source autospectrum recovery is 
not as accurate using microphone 5 as the noise reference (with respect to microphone 10), its performance is 
significantly better than spectral subtraction for this low SNR acquisition. Again, note that this recovery was 
obtained without adaptation. Only the saved filter weights from adapting the signals with the background only 
acquisition data were used in the processing. This procedure is similar to establishing a transfer function of the 
background noise between the reference and array microphones, then using this “transfer function” to remove noise 
from combined acquisition data. And in this case, as one of the array microphones was used as a reference, no other 
“reference” signals are needed (contrast with methodology from Cases 1 and 2 where a separately located reference 
microphone is used). 
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Figure 13 gives the beamform responses for this case at 6.3 kHz. Note because microphone 5 is used as 
reference it cannot be used as an array microphone for beamforming. This is because another array microphone 
would have to be used as a reference to attenuate the noise on microphone 5. This would lead to slightly different 
results not consistent with the other array microphones, because slightly different noise would be attenuated. 
Therefore, only array microphones 1-4 and 6-9 were used for beamforming and subsequent DAM AS processing. 
This applies to both Case 3 and 4. The level and shape discrepancies between the beamform responses from Cases 1 
& 2 vs. 3 & 4 are due to this. The combined point source and background speaker acquisition beamform, using now 
only 8 microphones, is even more distorted than that of Fig. 7. This is noted here because these are the same 
acquisitions. The spectral subtraction beamforming achieves a 7 dB reduction at the point source location (x = 0 ft) 
but clearly fails at recovering the point source response. The ANC beamform response using microphone 5 recovers 
the point source level at the speaker location to within 1 dB and has a similar sidelobe structure. The conclusion 
drawn from Fig. 12 holds here: while ANC recovery using microphone 5 as reference is not as accurate as using 
microphone 10, at this SNR it outperforms CSM subtraction and recovers the point source strength to within 1 dB. 




Grid Line, ft 


Figure 14. DAMAS results for Case 3 (6.3 kHz, SNR= -29.2 dB). 
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Figure 14 gives the corresponding deconvolved sound map for Fig. 13. Convergence was reached after 45 
iterations. The combined map displays four erroneous “source” peaks over the grid. The CSM subtraction results 
display three. All erroneous peaks are greater than the point source level. The deconvolved map using ANC with 
microphone 5 as reference correctly identifies the point source speaker location and estimates its strength to within 
0.1 dB. Note that compared to Case 1, Case 3 ANC processing yielded less accurate autospectra and beamforming 
results. However, the ANC beamforming results for Case 3 were accurate enough to yield the same accuracy in 
deconvolution as Case 1 ANC results (compare Figs. 8 and 14). 

Case 4. This final case employs the same processing technique as Case 3 on the data from the higher SNR and 
lower coherence acquisition. 

Figure 15 presents the array averaged autospectra for this case. The data are from the same acquisition as Case 
2. The same spectral subtraction performance from Case 2 is repeated: the point source spectrum is recovered 
accurately where SNRs are higher, and loses accuracy with decreasing SNR. The ANC processing recovers the point 
source data to within 2 dB across most of the 0.5-9. 5 kHz frequency range. The ANC processing outperforms 
spectral subtraction accuracy wise. As with Case 3 compared to 1, the Case 4 ANC recovery is not as accurate as the 
ANC recovery of Case 2. 



Figure 16 presents the Case 4 beamform responses. Due to the high SNR, the beamforming is able to accurately 
recover the point source spectrum from the combined acquisition without any noise attenuation processing. CSM 
subtraction further improves the recovery accuracy. ANC processing is even less accurate than the combined 
acquisition beamform response. This is due to the low coherence (0.43) present between microphone 5 and the rest 
of the array channels at this frequency and because filter weight adaptation was frozen. 

The deconvolved beamform maps are given in Fig. 17. Although the combined acquisition DAM AS result 
contains two erroneous peaks (x = -1.9, 0.7), they are more than 14 dB from the correct point source level at x = 0 ft. 
It estimates the point source strength to within 1.6 dB with no noise attenuation processing. The beamform (Fig. 16) 
and its deconvolution display the noise attenuation capabilities of these array processing techniques at this SNR. The 
deconvolved map using CSM subtraction accurately predicts the point source location and level to within 0.02 dB. 
This is excellent recovery at this SNR. As with the beamform response (Fig. 16), the ANC results are the least 
accurate. The predicted point source speaker location is 0.4” to the right of the true location. Its level was predicted 
2.8 dB higher than the actual. 
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Figure 16. Beamform results for Case 4 (8.4 kHz, SNR = -3.6 dB). 
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Figure 17. DAMAS results for Case 4 (8.4 kHz, SNR = -3.6 dB). 
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The Case 4 results demonstrate that at frequencies with relatively high SNRs the spectral and CSM subtraction 
methods provide accurate recovery of the point source level. The array technique of beamforming (without noise 
attenuation processing) is able to recover the point source magnitude and lobe structure. Subsequent deconvolution 
using DAMAS correctly estimated the point source speaker location with little error (Fig. 17). CSM subtraction only 
improved the accuracy of the beamform response and corresponding deconvolution map. Due to low coherence at 
the narrowband frequency chosen, the ANC results without adaptation using the center array microphone were less 
accurate than those with higher coherence (Case 3), and using microphone 10 as reference with adaptation (Case 2). 
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V. Conclusions 

The application of Adaptive Noise Cancellation with microphone array signals has been investigated in a 
simplified setup using a background noise and a point source speaker. The background speaker was intended to 
produce average Signal-to-Noise Ratios across the frequency band 0.5-9. 5 kHz as low as -20 dB at the array. A 
reference microphone was located close to the background noise speaker in order to obtain a nearly ideal acquisition 
of the background noise. 

Two cases investigated the noise attenuation processing at a low SNR and high coherence between the reference 
and array microphones. Spectral subtraction was insufficient in recovering the point source magnitude over the 
frequency band analyzed. CSM subtraction also failed at recovering a beamform response accurate enough to yield 
convergence on the correct point source location and magnitude in subsequent deconvolution using DAM AS. ANC 
processing using a reference microphone close to the background noise source proved highly accurate in recovering 
the point source magnitude over the frequency band, beamform response, and corresponding deconvolution map. 
The ANC processing using the center array microphone as a noise reference and no filter adaptation was also 
successful in the recovery of the point source magnitude and location, but yielded less accurate results than those 
processed using the reference microphone close to the background noise source. 

Two additional cases analyzed the noise attenuation processing performance under relatively higher SNR and 
lower coherence conditions. Spectral subtraction yielded accurate recovery of the point source magnitude at 
frequencies where high SNR was present. Beamforming of the signal plus noise acquisition was sufficient to recover 
the point source magnitude and location in the deconvolved beamform maps for these cases due to the higher SNR. 
CSM subtraction only improved this performance. The ANC results using the microphone placed near the 
background noise source yielded the best results of any of the processing methods. The low coherence did not 
reduce its recovery accuracy because of its ideal background noise estimate. ANC using the center array microphone 
yielded the least accurate results due to the low coherence present between the center and array microphones and 
because the filter adaptation was frozen. However, its recovery was sufficient to locate and predict the point source 
magnitude in its deconvolved beamform map. 

The ANC technique using a reference microphone placed near the background noise source would be effective 
for localizable noise sources which retain coherence between the reference and array microphones. Using the center 
array microphone as a reference to attenuate the noise on the other array channels with ANC has more general 
applications and the possibility of improving upon the current noise attenuation methods. Analysis of the techniques 
with a more populated microphone array under more realistic test conditions remains to be performed. 
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